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In this paper we discuss the problem of the estimation of extreme event occurrence probability for 
data drawn from some multifractal process. We also study the heavy (power-law) tail behavior of 
probability density function associated with such data. We show that because of strong correlations, 
standard extreme value approach is not valid and classical tail exponent estimators should be 
interpreted cautiously. Extreme statistics associated with multifractal random processes turn out 
to be characterized by non self-averaging properties. Our considerations rely upon some analogy 
between random multiplicative cascades and the physics of disordered systems and also on recent 
mathematical results about the so-called multifractal formalism. Applied to financial time series, 
our findings allow us to propose an unified framemork that accounts for the observed multiscaling 
properties of return fluctuations, the volatility clustering phenomenon and the observed "inverse 
cubic law" of the return pdf tails. 

PACS numbers: 0.250.-r, 05.40.-a, 05.45.-a, 89.65.Gh 



*Electronic address: muzy@univ-corse.fr 

^Electronic address: emmanuel.bacry@polytechnique.fr 

^-Electronic address: alexey@cmapx.polytechnique.fr 



1 



I. INTRODUCTION 



Statistics of extremes is an issue of prime importance in many situations where extreme 
events may appear to have disastrous effects or to govern the main observations. Such 
situations can be found in a wide range of fields from physics (e.g. disordered systems at 
low temperature), geology (earthquakes), meteorology (rainfalls, storms), insurance or fi- 
nance |1, . Extreme events are particularily relevant for random phenomena involving 
a probability density function (pdf) which tails decrease very slowly and roughly follow a 
power-law. Such heavy tailed distributions have been observed in many natural phenomena. 
An important question concerns therefore the estimation and interpretation of pdf tail ex- 
ponent as well as the identification of mechanisms leading to them. These problems are at 
the heart of a, increasing .tutaber of works flflfl. Probabilistic and statistical questions 
related to very high or very low values of random variables are addressed within the frame- 
work of extreme value theory. This theory has been originally developed for independent 
identically distributed (i.i.d.) random variables and more recently extended to stationary 
processes where independence condition has been relaxed Q]. However, when correlations 
are not weak enough very few results are known. 

In this paper we aim at studying the statistics of extreme events and the (fat) tail ex- 
ponent of fluctuations associated with multifractal random processes. Mutifractal extreme 
fluctuations are interesting because they represent an example of strongly correlated ran- 
dom variables that do not satisfy standard mixing conditions of extreme value theory. But 
multifractals are also interesting because they are widely used to model of self-similar phe- 
nomena displaying multiscaling properties. Our purpose is to study probability of extreme 
event of multifractal processes and the associated power-law tail exponent. We will show 
that for such processes, the pdf tail exponent value observed (estimated) from experimental 
data may be different from the value associated with the unconditional theoretical pdf. We 
examine different experimental conditions depending upon the size of the observed sample 
L, the correlation length T and the observation scale r. We emphasize that, under most 
usual conditions, the estimated tail exponent is smaller than the exponent one would expect 
without correlations. This result is intimately related to some non-self averaging property 
of usual tail exponent estimators and is the analog of the glassy behavior observed at low 
temperatures in disordered systems. We apply our phenomenological framework to multi- 
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fractal models of asset return fluctuations and show that the well known "inverse cubic law" 
of pdf tails can be naturally explained in terms of volatility correlations. 

The paper is organized as follows: In section |H1 we briefly review the main results about 
extreme value theory and the commonly used tail exponent estimators. Mult if r act al pro- 
cesses, multiplicative cascades and their main mathemetical properties are reviewed in sec- 
tion IIHI In sections IIVI we build an extreme value theory for multifractal cascades. The 
problem of the estimation of the power-law tail exponent associated with the cumulative 
probability distribution of multifractal fluctuations is addressed in section |V] In section IVII 
we illustrate our phenomenology by numerical examples of continuous as well as discrete 
cascades. Application to finance is considered in section IV11I while section IV111I contains 
concluding remarks and questions for future research. Auxiliary computations or technical 
material are reported in Appendices. 

II. FAT TAILS AND EXTREME VALUE STATISTICS 

Let us briefly review the main estimators used to characterize the power-law tail behav- 
ior of some probability distribution. Let F(x) = P [Z < x] be the cumulative probability 
distribution (cdf) of some random variable Z. The variable Z is said to be of power-law 
type tail or Pareto type tail if, when x — > +00, 



where C is a positive normalization constant or a slowly varying function. The exponent fi is 
called the tail exponent of the distribution. The problem addressed in this section concerns 
the estimation of this exponent from empirical data. 

A simple, widely used method relies on the so-called "Zipf" or "rank-frequency" plots 
(see e.g. j^l, Q, li| ) : Let Z\ . . . Z N be N i.i.d. samples characterized by the same distibution 
function F(x). Let us denote X\ > X 2 > . . . > Xn the rank ordered values of Zi (sorted in 
descending order). Then, if the asymptotic behavior of F(x) is Pareto like (as in Eq. (fljl). 
one has, for 1 < j < k <C N: 



and therefore /1 can be simply estimated as the slope of the Zipf plot [In Xj, ln(j)], j = 1 . . . k 



1 - F(x) ~ Cx~^ 



(1) 
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see j6[ for exact results). In the following, we refer to this estimator as the power-law fit 



estimator. Since this estimator is biased 



one should use alternative estimators. 



Actually, there are many alternative tail exponent estimators Q|. The most commonly 
used are Hill or Pickands estimators that are defined as follows: Let k(N) = o(N) be the 
maximum X rank used to estimate /x, the Hill estimator is simply 

Mk,N)= J'* (3) 

Ei=i H x i/ X k) 



while the Pickands estimator is: 



ill I tt 4 zr^- I 



The mathematical study of these gators (consistency, bias, asymptotic normality) 
relies upon Extreme Value Theory |1J,|2||, i-e., the theory that deals with maxima and min- 
ima properties of random variables. According to this theory, the maximum value Y of 
N i.i.d. random variables (normalized properly), has a probability density function that 
asymptotically belongs to the Fisher-Tippett's Extreme Value distribution class. According 
the the shape of the pdf of Y, the pdf of the maximum can either be of the Frechet type 
(Fy(x) = e~ x M ), of the Gumbel type (F Y {x) = e~ e x ) or of the Weibull class 

Extreme value theory has also been extended to dependent (or correlated) stationary 
random processes [jj. Under mixing conditions ensuring asymptotic independence of max- 
ima, it can be shown that the limit theorems established in the i.i.d. case still hold. The 
key difference is that the number N of independent variables is replaced by an "effective" 
number NO. The value < 6 < 1 that quantifies the effect of dependence is called the 
extremal index In the case of Gaussian processes, these theorems hold provided the 
covariance function p(x) decreases sufficiently fast for large lags x, i.e. 

p(x)ln(x) -> (5) 



x— >+oo 



A simple intuitive justification of l/ln(x) as the limiting case for the validity of standard 
extreme value theorems is provided in ref. []|. 

The main purpose of this paper is to try to understand how these theoretical results 
apply to data sampled from a multifractal process. 
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III. MULTIFRACTAL PROCESSES 



Multifractal processes are random functions that possess non trivial scaling properties. 
They are now widely used models in many areas of applied and fundamental fields. Well 
known examples are turbulence, internet traffic, rainfall distributions or finance. For the 
sake of simplicity we will consider only non- decreasing multifractal processes (often referred 
to as multifractal measures though their variations are not bounded) denoted hereafter M(t). 
More general multifractal processes can be conveniently builded as a simple Brownian motion 



B(t) compounded with the measure M(t) considered as a stochastic time: X{t) = B 
The statistical properties of X{t) can be directly deduced from those of M(t) (see e.g., 



A. Multiscaling 

Multifractal processes are characterized by multiscaling properties of their variations. 
More precisely, if one defines the increments of M(t) at scale r, M(t, r) = M (t + r) — M(t), 
multifractality can be loosely defined from the scaling behavior of the moments of M(t, r): 

E[M(t,r) q )] ~ r^ q) (6) 

where q G R is the order of the moment and the exponent ((q) is some nonlinear convex 
function often called the multifractal exponent spectrum. The simplest example of such 
function is the so-called log-normal spectrum for which ((q) is a simple parabola: 

C(?) = (1 + y)fl - W 

The coefficient A 2 quantifies the curvature of ((q) (and hence the multifractality of the 
process) that is constant in the log-normal case. In the general case, one often calls — ("(0) 
the intermittency coefficient. Let us notice that, because of Holder inequality for moments 
the scaling with a non linear convex ((q) cannot hold at arbitrary large scales r but is 
valid only in a domain bounded by some large scale T (actually the limit r — > in Eq. (JSJ) 
must be understood as r/T 1). This scale T will be called the integral scale. 



B. Singularity spectrum and multifractal formalism 



Let us recall some classical results about the multifractal forma" 
been introduced in early eighties by Parisi and Frisch (see e.g. 10 



ism. This formalism has 
lll |) in order to interpret 



the above multiscaling properties of the moments in terms of pointwise regularity properties 
of the paths of the process M(t). Let us define the local Holder exponent a (to) at point (or 
time) t Q as 

M(t Q ,r) ~ r a( * o) (8) 

The limit r — > means t ^ T where T is the integral scale. The singularity spectrum 
f*((x) can be introduced as the fractal (Haussdorf or packing) dimension of the iso-H61der 
exponents sets: 

f*(a) = Dim{t, a(t) = a} (9) 

Roughly speaking, this equation means that at scales r<T, the number of points where 
M(t, t) ~ r a is 

N(r,a) ~ r- f ^ a) . (10) 

The multifractal formalism states that f*(a) and ((q) as defined in Eq. @ are basically 
Legendre transform one to each other. More precisely, if we define f{a) as the Legendre 
transform of ({q), i.e., 

f(a) = l + min(ga-C(g)) 

i 

C(q) = l + min(ga-/(a)), 

a 

then 

= f(a), Va* > a > a*, 

where a* and a* are defined by 

a* = inf{a, f*(a) = 0} 
a* = sup{a;, f*(a) = 0}. 

In the following sections, we will use the fact that q can be interpreted as a value of the 
derivative of f(u) and conversely a is a value of the derivative of ((q) : for a given value of 
q = q one has, from previous Legendre transform relationship and thanks to the convexity 

of CO?): 

f(a ) = 1 + q ao ~ C(<7o) (11) 
«o = f g (qo) (12) 

qo = £ (ao) ' (13) 



Let us note that, f*(a) can be seen as the Legendre transform of the function (*(q) simply 
defined as 

((q) for q* < q < q± 

a+q for q > q+, , (14) 
a*q for q < q* , 



where 



da 
df 



[a* 



^ da 



It is important to point out that, experimental 
not C(q)) can be estimated (see e.g. refs. jl^. \j" 



(15) 
(16) 

under usual conditions, only (*(q) (and 



Cascades 



The paradigm of multifractal measures are multiplicative cascades originally introduced 



to discrete random cascades 



)t 


1 




fi, 


i 




u 


,: 
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a lot of mathematical studies have been devoted 



2l|. Very recently, continuous versions of these 



cascades have been defined: they share exact multifractal scaling with discrete cascades but 
they display continuous scaling and possess stationary increments B, S Q Q Let us 
summary the main properties of these constructions and set some notations. 

The simplest discrete multifractal cascade can be constructed as follows: one starts with 
an interval of length T where the measure is uniformly spread, and split the interval in two 
equal parts: On each part, the density is multiplied by (positive) i.i.d. random factors W. 
Each of the two sub-intervals is again cut in two equal parts and the process is repeated 
infinitely. At construction step n, if one addresses a dyadic interval of length T2~ n by a 
kneading sequence k± . . . k n , with ki = 0, 1, the "mass" of this interval (denoted as Ikx...k n ) 
is simply: 



M n (I kl ...k n ] 



En 
, i=i w fei.. 



i=l 



(17) 
Q 



where all the W^...^ — e Uk ^- k i are i.i.d such that E [W] = 1. Peyriere and Kahane 
proved that this construction converges almost surely towards a stochastic non decreasing 
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process provided E [Whi W] < 1. The multifractality of M M (hereafter simply denoted 
as M) and the validity of t he p reviously described multifractal formalism have been studied 
by many authors (see e.g. 12]). An interesting additional property of cascades is that they 
are self-similar in the following stochastic sense: 

M[I kl ... kn ] = 2- 1 WM[I kl ... kn _ 1 ] (18) 

law 

and therefore the order q moments of M(t, r) behave as a power-law: 

E [M[0, T2~ n ] q ] = 2- nq E [W q ] n E [M[0, T] q ] (19) 

Comparison of Eqs. f)19|) and © with r = T2~ n directly yields the expression of the 
spectrum ((q) in terms of cumulant generating function of u = \nW: 

C(g) = g-ln 2 (E[W"]) (20) 

In that sense, ((q) is nothing but a large deviation spectrum. Let us mention that the 
validity of the multifractal formalism has been rigorously proved for cascades. In Appendix 



El we provide explicit expressions O i 

Let us note that, as shown in 
decreases slowly as 



C(q) for various laws of W. 



24, 



2fij |. the correlation function p(x) of such cascades 



p(x) ~ ln(T/x) for lags x < T. (21) 

If we consider that the data come from a sampling of successive independent cascades, the 
correlation function is for lags above T, i.e., 

p{x) = , x > T. (22) 

The integral scale T where cascading process "starts" can therefore be interpreted as a 
correlation length for the variations of the M(t). 

Because the previous construction involves dyadic intervals, and a 'top-bottom' con- 
struction, it is far from being stationary. In order to get rid of this drawback, as already 
mentionned, some continuous cascade constructions have been recently proposed and stud- 
ied on a mathematical ground Without entering into details, we just want 
to mention that such continuous cascades involve a family of infinitely divisible random 
processes u>i(t) which correlation function basically follows Eqs (j2*Tj) and (J2~2")l . The process 
e ^i{t) j s fa e analog of the density satisfying the self-similarity: 

e ^ s i{st) _ e n ae wi(t) 

law 



Martingale theory allows one to prove the convergence of the continuous process M[0,£] = 
lim^o £ eUJ ' {u) du - 

Let us mention that the validity of the multifractal formalism has been established by 
Molchan b y for discrete cascades Q and ge— by Barral aed Mandelbrot for 
continuous cascades 2^|. 

In the sequel, we will be using indifferently the classical top-bottom model (i.e., we will 
consider that the data come from a sampling of a succession of independent realizations of 
the same cascade process) and the continuous cascade model (i.e., we will consider that the 
data simply come from a sampling of a continuous cascade). Most of the arguments will 
be done using the first model while numerical examples will be performed on data of the 
second model. 



D. Cascades have fat tails 



Let us first emphasize that the unconditional law of M can have a power law tail. Indeed, 
a simple argument involving the self-similarity of the limit measure allows one to obtain a 
simple bound: since the measure is additive, M[0, 1] = M[0, 1/2] + M[l/2, 1], for q > 1 one 
gets 

E [M[0, l] q ] > 2E [M[0, 1 /2} q ] = 2 1 ~ C(,?) E [M[0, l] q ] (23) 

and finally E [M[0, l] q ] < +oo =^> ((q) > 1. The reverse implication is basically true. It is 
however trickier to obtain and we refer to |Lj| for a precise proof. 

Since the power law tail exponent of a distribution is directly related the the maximum 
order finite moments, if one defines 

fi= l + sup{g,g> l,C(g) > 1} , (24) 

then , cascades (discrete as well as continuous) have thick tails of exponent \x: 

¥[M(t,r)>x] ~ x-» . (25) 

E. Defining the asymptotic limit N — > +oo 

As we have seen (Eq. (|21|)). the covariance function p{x) for multifractal processes has 
a very slow decay (slower than Eq. (JSJ) up to the integral scale T above which data are 
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independent. Thus we expect that mixing conditions are not valid in the range from the 
sampling scale r to the integral scale T while they hold when "looking" above scale T. Let 
us try to be more precise: the mixing conditions are conditions in the limit when the total 
number of samples N goes to +00. However, there are several ways to reach this asymptotic 
limit. Let L be the length of the whole sequence and r the sampling scale. The total number 
of samples is therefore, 




while the number of integral scales and the number of samples per integral scales N T are 
respectively: 

N T = ^ , N T = -, N = N T N T . (26) 

1 T 

In order to control the relative values of Nt and N T , let us define the exponent x as follows: 



N T ~ N* (27) 

Let us note that this exponent \ has been already introduced by B.B. Mandelbrot as an 

261 ] in order to discuss the concept of negative dimension (see 



"embedding dimension" 
below). 

Thus, when iV — > +00, if, for instance, x = 0, it means that we are in the case r — > 
(while L and T are fixed), i.e., most of the data are lying between the lags r and T. 
Consequently, we do not expect the mixing conditions to hold. On the contrary, if x — +00, 
it means that we are in the case L — > +00 (while r and T are fixed), i.e, most of the data 
are lying between the lags T and L and consequently the mixing conditions are satisfied. 

Thus, as it will be discussed in the next two sections, in the first case (x = 0), nothing 
guarantees that classical results of extreme value theory can be applied and that exponent 
tail estimators provide the expected values, whereas in the second case (x = +00), we expect 
the i.i.d. extreme value theorems to hold and the exponent tail estimators to be consistent. 
As we will see, one can go continuously from the first case to the second one. Actually 
we will show that both the extreme value distribution associated with cascades and the 
corresponding tail exponent estimator strongly depend on the value of x- 
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IV. MULTIFRACTAL EXTREME VALUE STATISTICS 



A. Cumulative probability distribution of the maximum 

Let t = T2~ n . We call Xi(N) the maximum value of ln(M(/ n )/r), where I n is a short 
notation of the dyadic intervals Iki...k n °f s i ze r - Let P(x,N) = ¥[Xi(N) < x] be the 
cumulative distribution function (cdf) of Xi(N), i.e., the probability that Xi(N) is smaller 
than x. 

Let us recall that we consider that the data come from a sampling of successive indepen- 
dent realizations of the same cascade measure. Thus T (the integral scale) is fixed whereas 
L (the total length of the data) and r (the sampling scale) are varying. We want to study 
the statistics P(x, N) of the data in the limit iV — > +00. As we explained in the previous 
section, these statistics will strongly depend on \ (Eq. (j27jl ). i.e., on the way iV T and Nt go 
to +00. We fix x — r /p an d we choose the following parametrization: 

N T ~ 2 pm , N T ~ 2 rm , N = N T N T ~ 2 (p+r)m , 

when the integer parameter m — > +00. 

Thus P(x, m) = P(a;, N(m)) is the cdf of In M(J pm )/r where r = T2~ pm 
In Appendix |H1 we show that P(x, m) can be simply expressed as: 

P{x,m) = (P'(x,pm)) (28) 

where P'(x,n) is the cdf associated with the maximum of ln(M(J n )/r) on a single integral 
scale instead of the whole data. It satisfies the renormalization equation: 

P'{x,n+ 1) = lP'(x,n)*g(x)} 2 (29) 

where * stands for the convolution product and g(x) is the probability density of to = \n(W). 

Let us notice that the initial condition P(x, 0) is precisely given by the law of ln(M ) 
which exponential tail is described by Eq. (|2~5j) : 

P(x,0) ~ l-Ce - "* (30) 
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B. Traveling front solutions 



1. Case x = +o° 

When r = 1 and p = 0, Eq. (J28|) is simply the recurrence for the maximum cdf of i.i.d. 
random variables which solutions are Fisher-Tippett's fixed points reviewed in sec. |n] More 
precisely, since the initial condition is exponentially descreasing (Eq. (|3U|)). when N — > +oo, 
P(x, N) will have a Gumbel shape. Consequently, the law of the maximum value of M will 
belong to the Frechet class with a tail exponent \x as defined in Eq. (|24|) . 



2. Eq. W^l and the KPP equation 



In order to solve the nonlinear problem ()28)1 in the general case, let us first study Eq. 
fl29|) and its solutions. We will show that these solutions are traveling fronts so that, in some 
moving frame, Eq. (J28j) reduces to a standard (i.i.d.) extreme value problem. 

Equation (|2*9j) is exactly satisfied by the cdf associated with the maximum value of random 



variables hierachically correlatec 



tion has been studied in ref. 
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generated additively along a Cayley tree). Such an equa- 
28]. In ref. Q, the authors have considered a log- normal 



random process with log-correlated covariance, that can be considered as a continuous ver- 
sion of a random cascade. In this case, the obtained partial differential equation for the 
law of the maximum turns out to be the famous Kolmogorov-Petrovsky-Piscounov (KPP) 
equation. Thus Eq. (jUIj), in the case where g(x) is Gaussian, can be seen as a "discretized" 
version of the KPP equation. 

It is well know that the KPP equation has traveling wavefront solutions connecting the 
homegeneous stable state to the unstable one. These solutions can be studied using linear 
stability analysis. As emphasized in most of KPP features are somehow universal in 
the sense that the same analysis can be performed for a wide variety of "reaction-diffusion" 
problems involving non linear integro-differential or integro-diffence equations. In particular 
the famous marginal stability criterium (see e.g. refs. 29l I30L l3l| and below) for the selected 



front velocity can be generically applied for a large class of equations. Therefore, as explained 
in j^J 0, one can apply the same techniques to get solutions of Eq. (J2HJ). Though 
these studies do not rely on a fully rigorous mathematical ground (as KPP), we reproduce 
a similar analysis in the following. 
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Let us first notice that both Eqs (|28|) and Eqs (|29|) admit two homogeneous solutions 
P = and P = 1, the first one being (linearly) stable while the second one is unstable. 
A given cdf P(x,N) (or P'(x,n)) will therefore connect the stable state to the unstable 
one. As in the above cited references, one can consider a traveling front solution of (|29j) 
P'( x, n) — Pj. w (x — v'n) where v' is the front velocity. In order to compute this velocity, one 
performs a linear analysis in the vicinity of the unstable solution, i.e., in the limit x — > +00 
where P{ w (x — v'n) — > 1. 

If we denote 

Q' tw (x) = 1 - Pl w {x) « 1 
then, to the first order in Q' tw (x), Eq. (J2l?j) becomes : 

Q' tw (x - v') = 2Q' tw (x) *g{x) + (QfJ 

If one seeks for exponential solutions: 

Q' tw {x) = Ce-«* ; 

then, 

P/Jx, n) = 1 - Ce^ x ~ v '^ , (31) 
where v' and q satisfy the "dispersion" relation: 

v '(q) =q- 1 \n 2 / g{x)e qx dx 

Notice that g(x) is the law of the logarithms of the weights of the cascade construction, and 
then, from Eq. (20)) 

ln(2 J g{x)e qx dx) = ln(2) (1 + q - ((q)) 

which yields: 

A,) = h ' (2)(1+ ;- CM) (32) 

In the next section, one shall see which velocity (or which q value) is selected 
3. Case % 7^ +00 

From Eqs. ()28j) and (|3ip. one gets the following traveling front solution for P, 

P(x,m) = (1 - Ce- q{x - v ' {q)pm) f rm , 
13 



where v'(q) is given by 

If y — x — v'(q)pm — g -1 r ln(2)m, the cdf of y converges to the Gumbel shape: 



P(y,m) = (1 -Ce- qy ~ rl ^ 2)m f rm — > e 



and thus m) is itself a traveling Gumbel front: 

P(x,m)=e- Ce - qix - mvM) (33) 

with the dispersion relation: 

v{q) = ln(2) (r+p(l + g-C( g ))) (34) 
One can reproduce the same kind of analysis as in refs Q,Q,QHQ- 

In Appendix 

O we provide a sketch of proof that one can use a standard Aronson- Weinberger stability 
criterium to compute the velocity and q value which are selected: Let q m i n be the unique 
positive q value such that 

v{q min ) = mmv(q) (35) 

g>0 

and let 

g* iX = mm(fi,q min ). (36) 

Then the selected velocity is simply v(q+ tX ) and the shape of the traveling front is (up to a 
subdominant correction described in Appendix IH|) 

P(x, m) = e 

Let us recall (see section llV Ajl that P(x,m) corresponds to the cdf of In M(J n )/r where 
r = T2~ n and n = pm. Let us go back to InM. 



4. From In M (I n ) /r to In M(I n ) 

The velocity v(q) is a velocity related to the parameter m, i.e., P(x, m) = P{x — v(q)m). 
It is convenient to compute the velocity v T (q) as respect to the "observable" scale ln(T/r), 
i.e., P(x,m) = P(x — v T (q) ln(T/r)). Clearly, v T (q) = v (q)m/ ln(T/r). Since r = T2 _ra and 
n = pm, one gets f T (g) = v(q)/pln(2). Now, if we switch from the cdf of In M(J n )/r to the 
cdf of In M(J n ), one finally obtains the velocity: 

v lnM (q) = v{q)/pH2) - 1 = - C(g) ~ 1 ~ X 
14 



Let us note that the minimum of t>i n M(<?) (i-e., the minimum of v(q)) is reached for q min 
satisfying 

A x ' H ^ SVimin)- 

One can also notice that 1 + q^j^- — ((q) is the Legendre transform of C(o)- We know from 
section UlI Bl that this Legendre transform is nothing but the spectrum f(a). According to 
the results of this section (and particularly Eq. (JHJ) it can be easily seen that this value of 
qmin corresponds to a singularity exponent a min with 

f{0imin) = ~X (37) 
qmin ~J {otmin) (38) 

da 

Otmin = VlnM(qmin)- (39) 

Moreover, since f(a m i n ) < then df(a m i n )/da > and consequently q m i n > 0. Thus, this 
value of q min does correspond to the one defined in Eq. (J33J). 

We finally established that the law of the maximum of M at scale r of a sample of length 
r~ x is Frechet when r — >• with a tail exponent g* iX that depends on x- 

q^ x = mm(q min ,n) 

qmin ~^^{c^min} (^0) 

Let us remark that for x = 0, the value of g* iX is exactly the value q* that is involved 
in the multifractal formalism as described at the end of section 1111 Bl (Ec 



. dnj)). For finite 

X value, as already noticed more than a decade ago by Mandelbrot |26j], the exponent x 
that governs the size of the "supersample" allows one to explore negative dimensions (i.e., 
negative values of f{a)). 

Numerical evidence of the so-obtained results are reported in section IV1 Al 
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V. ORDER STATISTICS, TAIL EXPONENT ESTIMATION AND MULTIFRAC- 
TAL FORMALISM 



A. Notations 



The idea underlying tail exponent estimation is to study statistics of the k(N) observed 
extreme values in the asymptotic regime N — > +00 and k(N) — ► +00. As in the previous 
section, the limit N — > +00 is taken as explained in section UlI El using the x exponent (see 
Eqs (J2ED and (|27jl ). 

For the k(N) — > +00 limit, it is convenient to parametrize k(N) as: 

k(N) ~ iV" (41) 

where < f < 1, the value = being interpreted as fc(iV) ~ ln(iV). We will denote x) 
the estimated tail exponent using one of the estimators reviewed in section |H]for some given 
X and v (x is defined in the section UlIEj) . At scale r = T2~ n , as in section ITT| we denote 
Xi . . . X N the rank ordered values of M(J n ) over dyadic intervals. 



B. Tail exponent estimators 

Let us compute the expected value of the tail exponent estimator (Power-law fit, Pickands 
or Hill estimator) for fixed values of v and x- F° r the sake of simplicity we focus on 
Pickands estimator but the same argument equally applies to other estimators (e.g. Hill). 
Our heuristics will rely upon the multifractal formalism. 

As recalled in section IIIIB1 the positive part of the Legendre transform of ((q) corre- 
sponds to the singularity spectrum, i.e., the Haussdorf dimension of sets of iso-regularity. 
Mandelbrot has proposed a long time ago, a probabilistic interpretation of negative dimen- 
sions (i.e., negative values of f{a)) in terms of large deviation spectrum 2f|. In this section, 
we will use this formalism in order to study the order statistics of multifractal fluctuations. 

The Pickands estimator is simply defined as: 

-1 



£(^x)=ln(2)(ln ^ X * )~ (42) 



Let us define a u>x such that 



k = N u ~ Nt 
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Using the definition of x according to which Nt ~ (r/T) x , it follows that a U)X satisfies 

f(a VtX ) = v- X {l-i') (43) 

Let us now consider a' = a va + e\ such that 

r\-/K,x) 



2k ~ N T (r 

i.e., 2fc ~ Nt (i)-^*^ 1 *"*, thus 2" 1 /^ ~ where 

= ^K>x) • ( 44 ) 
Along the same line if a" = a„ iX + e 2 such that 

(T \ -f«x) 

4k ~ iV T (-) 

we have ~ (fO^- Thanks to the fact that, 

X k -X 2k ~ X fc (l-(r/T) ei ) 
X 2fc - X 4fc ~ X fc ((r/T) ei - (r/T) 62 ) 

one finally gets for expression (J42j) : 

/ 1 _ 2- 1 /^,x \ _1 
X) * ln ( 2 ) 2 -i/ gy , x _ 4-i/ g „ x J = ft« (45) 

We then come to the conclusion that the Pickands tail estimator for a multifractal process 
strongly depends on the choice of the rank k = N u and the exponent \. This is another 
difference with standard theory for i.i.d. random variables. The same kind of phenomenology 
can also be applied to the Power-law fit or Hill estimator. Let us notice that when v = we 
consider only the 'extreme' of the 'extremes' for the tail exponent estimation and therefore 
we recover the tail of the law of the maximum value as discussed previously. Indeed, from 
Eqs. flUD, dSJ and (g3J), we get 

A(0,x) = f(ao, x ) 
f( a o, x ) = -X , 

that is, exactly Eqs. (JJUJ). 

Numerical evidence illustrating these results are provided in the next section. 
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FIG. 1: Cumulative probability distribution of the maximum of ln(M) at scale r = T2~ n , P(x, n) 
(also referred to as P(x, r)), as a function of x for various values of r for a continuous log-normal 
cascade with A 2 = 0.2. (a) Nt = 1 and h^T/r) = 3 (continuous line), 4, 5, 6 (dotted lines). 
According to the formalism of section IIV1 the cdf should be a front moving toward x < at a 
"velocity" a* iX= o = a* — 0.47. (b) All the cdf of Fig. (a) merge to a single curve when plotted in 
the "moving referential", (c) Same plot as Fig. (a) but with x = 1- One expects a smaller velocity 
a*,x=i ~ 0.2. (d) Same plot as figs, (a) and (c) for scales h^T/V) = 3,4,5 with x = 3. One 
observes, as expected, a negative velocity a*, x =3 — —0.16. 

VI. NUMERICAL EXAMPLES 



Most of our considerations rely on phenomelogical scaling and asymptotic limit argmu- 
ments. We therefore neglected prefactors and slowly varying corrections, i.e., finite size 
effects. In order to test our approach as well as to quantify the importance of finite-size 
corrections, it is therefore interesting to perform numerical simulations. In this section, we 
illustrate our purpose on specific examples such as those mentionned in Appendix El As 
already explained, most of these simulations are performed on continuous cascades which 
construction algorithm is described in ref. 
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FIG. 2: "Gumbel plots" of the cumulative probability distribution of the maximum of ln(M) at 
scale r = T2~ n , P(x,n) (also referred to as P(x,t)) for different values of x- One sees that as % 
increases from to 1.5, (7* iX , the slope in the tail increases significantly. 

A. Extreme value statistics 

In order to check the results of section IIVI we have generated iV = 5000 independent re- 
alisations of log-normal continuous cascades of intermittency parameter A 2 = 0.2 for various 
values of the parameters iV T and Nt- From these independent samples, we have estimated 
the cdf of the maximum value of ln(M) at scale r = T2~ n . In Fig. Q^a) the cdf P(x,n) 
(also referred to as P(x, r)) is plotted for Nt = 1 and T/r = 8, 16, 32, 64. According to Eqs. 
(j2IIJ) and (|3U|). we expect to observe, as the scale ln(r/T) decreases, a front traveling towards 
negative x at velocity a*, x =o — «*■ This behavior is well verified and can be quantitatively 
checked in Fig. db) where all the fronts merge when plotted versus x — a*ln(r/T) with 
a* ~ 0.47 as given according to Eqs. (f4*Uj) and (Q): 

a* = 1 + y - \VW+X) ■ 

In Fig. [TJc) the same analysis is performed for Nt = T/t and therefore x — 1- O ne 
observes that the velocity decreases as expected (in that case a*, x =i — 0.2). For x large 
enough, a* iX can become negative as illustrated in Fig. QJc) where we have reproduce the 
plot of Fig. Ufa) for r = 8, 16, 32 and x — 3. According to previous formula, a*, x =3 — —0.16, 
a value compatible with observations where one sees the front moving towards positive x 
values as the scale decreases. The fact that a* >x is negative for x large enough can be easily 
understood as follows: because the measure is continuous, as r decreases, the maximum is 
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expected to go to zero, but if in the same time, the number of indenpendent integral scales 
is increased, the maximum is expected to increase. The exponent \ controls the balance 
between these two opposite effects. For \ large enough one explores negative dimensions 
that can be associated with negative a's. Notice that in all plots one observes a slight 
change in the shape of the front as the scale goes to zero: this is not surprising because the 
asymptotic shape of the front depends on \ and is a priori different from the initial front at 
scale T/r = 8 considered in Fig. [TJ 

According to Eq. ([33)1. the asymptotic shape of the fronts should be Gumbel, i.e., 



Notice that q* jX is an increasing function of x- in Fig. |2l we have plotted ln(— \n(P(x)) versus 
x for various values of T/r = N T and Nt- We have chosen the values (N T = 512, Nj< = 1), 
(N T = 512, Nt = 8), (N T = 512, Nt = 64) and (N T = 64, Nt = 512) that correspond 
respectively (if one neglects prefactors in our analysis) to x — 0, x — 0.33, x — 0.67, 
X = 1.5. For an exact Gumbel law, one would expect a straight line. One clearly observes 
strong deviations to the Gumbel shape because the asymptotic regime is not reached but 
the tails look straight and are very well estimated by the theoretical g* iX values associated 
with the values of x~- one sees a systematic increase of g* iX when one goes from x = to 



B. Tail exponent estimators 

Let us now check our results on the tail behavior of the estimated pdf from measure 
samples. 

Let us first compute fi UtX as given by Eq. (|4*Hjl . i.e. solve Eqs. (|4*3^) and (|4*ljl for respectively 
log-Normal, log-Poisson and log-Gamma examples. After some simple algebra, we find in 
the log- normal case 




with parameter g* jX for the log-normal model: 




X = 1.5. 




(46) 
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FIG. 3: Rank-Frequency (Zipf) plots in log-log representation for a log-Normal cascade (top) and 
a log-Gamma cascade (bottom). The slope of the right linear part provides an estimate of the tail 
exponent fj,. Dashed lines indicate analytical expectations (see text). 

Let us notice that for v = 0, one recovers previous tail exponent of the extreme values 
A(O)X) = Q*,x- I* 1 t ne case of log-Poisson statistics (Eq. (|A3|l ). the computation leads to 
an expression for (x(y, x) that involves Lambert W(x) function. A pertubation series in the 
limit 5 — > of this expression gives: 



fa P {v, x) 



2(1 + x )(l -j/) + 2J (l + x)(^-l)-l + 



A 2 



3A 2 



(47) 



Along the same line, the value of fi(u, x) can be computed in the case of the log-gamma 
cascade (Eq. ([ A4[) ) which again involves the second branch of the Lambert function W-\\ 



1+W- 



1-e 



\ , (i+x)(i-") 



- e " 1+ »fi 



(48) 



A series expansion in the limit (3 — > +oo gives: 



2(1 + *)(!-!/) 



u) | 4 (z/-l)(l + x) | 



(49) 



A 2 3(3X 2 
In Fig. |3J the rank ordering of the a log-normal and a log-Gamma (with (3 — 4) cascade 
processes with an intermittency coefficient A 2 = 0.2 are plotted in doubly logarithmic scale 



21 



o 







(a) 




♦\ 

\ 








□ 

o 


° ?T^8 — | 


'i 


— 1- 


1 'l 1 'l 
1,1 


■ 'l ■ H . H , 


(b) 

- 




0.1 


0.2 3 0.3 

A 


0.4 



FIG. 4: Mean estimated tail exponent as a function of the intermittency coefficient A 2 for contin- 
uous log-normal cascades. Each mean value is computed using 1000 cascade samples of length L. 
(a) Power-law fit estimator as a function of A 2 for T = 1024, L = 4096, k = 16 (•), k = 32 (o), 
k = 64 (A), k = 128 (□) and T = 512, L = 8192, fc = 32 (a). The continuous line corresponds 
to the theoretical prediction [i[ n = y/2/ A 2 (Eq. (|46j0 . (b) The same as in (a) but each curve has 
been rescaled by a factor y/2(l — v)(l + x) according to Eq. ijlHjl (see text). 

(such plots are often referred to as "rank-frequency" plots or "Zipf" plots 0,0]). One clearly 
sees that the rightmost part of each distribution behaves as a power law. From Eqs. (|4fijl 
and (|4"%|) (with \ = and v — 0), the slope of the plots should be respectively \i ~ 3.1 and 
/i ~ 2.0. One sees that these behaviors, reported on the figures as dashed lines, fit the data 
relatively well. One can observe that the the scaling range associated with the log-Gamma 
cascade is wider than for the log-Normal measure. Indeed, according to Appendix [Dl (see 
Eq. (jDlJl ). in the log-Normal case, this range should be around 0.2pg^ ln(T/r) ~ 0.7 for 
p = 0.1 while its values for log-Gamma is expected to be 1.13pg*ln(T/r) — 2 for p = 0.1, 
i.e., more than two times wider than for the log- Normal case. This difference can be visually 
checked in Fig. El 

In Fig. HI we have plotted the mean value of the tail exponent estimator (power-law 
fit estimator) as a function of the intermittency parameter A 2 in the case of a log-Normal 
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FIG. 5: Mean estimated (Power-law fit) tail exponent (o) as a function of f3 for discrete log- 
Gamma cascades with A 2 = 0.2 and T = 2048. The Hill parameter is k = 32 and each mean 
value is computed from 1000 cascade samples. The continuous line corresponds to the analytical 
expression (|4*%|) 



(continuous) cascade for various values of k, and N T . The theoretical prediction ()46|) 
for v = is shown in continuous line for comparison. The mean has been evaluated on 
10 3 independent cascade samples. In Fig. Ufa), one can see that for all parameter values, 
the curves have the same decreasing behavior. All these curves collapse on the theoretical 
prediction a/2/A 2 as shown in Fig. 0Jb), if one rescales each one by a factor \J [1 — v){\ + x) 
where v and x are computed from expressions (|41jl and ()27j1 by assuming that the prefactors 
are trivially 1. Empirically we find that the asymptotic phenomenology works quite well and 
prefactors or slowly varying behavior are negligible. As previously emphazised, the exact 
computation of prefactor values and finite size effects is beyond the scope of this paper and 
should involve more sophisticated mathematical tools (see appendix 

In Fig. El we compare the expression (|48jl to the behavior of the tail exponent estimator 
as a function of f3 in the case of a discrete log-Gamma cascade: in that case, A 2 is fixed 
(A 2 = 0.2) while (3 varies. Once again, we can see that if we take into account finite k value 
through the value of v and x close their expected values, the analytical expression provides 
a very good fit of the data. 

We have therefore illustrated, on two specific examples, that the phenomenology devel- 
oped in previous section allows us to predict with a relative precision the tail behavior of 
multifractal measure samples. Let us now show how financial time series fluctuations can 
be described within this framework. 
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VII. APPLICATION TO FINANCIAL DATA 



Multifractal models are, with many regards, well suited to account for return fluctuation 
of financial assets 0, 3]. Among the "stylized facts" characterizing the asset return fluctu- 
ations, the phenomenon of "volatility clustering" (called heteroskedaticity in econometrics) 



is the most important one. One of the key points raised in refs. 
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351 ] is that these 



volatility correlations are found empirically very close to the "log-correlations" of continu- 



ous multifractal cascades 



24j . Therefore the observed multiscaling properties of returns can 



be simply explained in terms of volatility persistence. In this section we want to stress that, 
for the same reason, namely the logaritmic shape of log-volatility correlations, the pdf of 
return fluctuations appear empirically as fat tailed with a rather small tail exponent. 

we have shown that a parcimonious model of X(t), some asset return 



In ref. 



22, 



value at time t, can be constructed as follows: 



X(t) = B(M(t)) 



(50) 



where B(t) is the standard Brownian motion and M(t) is a multifractal continuous cascade 



as defined in refs. 



9] 



M{t) = lim / e Mt) dt 



The process 07 (i) plays exactly the same role as u = ln(W) in discrete cascades. It is easy 
to see that the variations of M(t) in ()50|) can be interpreted as a stochastic variance. This 
quantilty is called the "volatility" in finance 3^. 

If M(t) has multiscaling (e.g. log-normal) properties then so do X(t). Empirically, 
it has been determined using data from several markets, from various countries, that the 
intermittency coefficient charaterizing the multifractal statistics of the volatility is close to 
A 2 = 0.2 while the integral scale T is typically around 1 year 22]. On the other hand, many 
studies relying on high frequency data or on thousands of daily stock returns, have revealed 
that the financial return pdf have heavy tails with a tail exponent [i in the interval [3,5]. 



34, 



361 ] . This observation 



This is the famous "inverse cubic" law for return fluctuations 
lead to one of the main objections raised against the previous multifractal model for asset 
returns t 34] . Indeed, the unconditional pdf of the volatility associated with a log- normal 
multifractal cascades of coefficient A 2 = 0.15 has a tail exponent \i ~ 13 (Eq. (|A2|) ). Within 
the "subordinated" model (]STj|) . this would mean that the tail of the volatity pdf is around 
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FIG. 6: Rank-Frequency plot of CAC40 daily volatility estimates (•) as compared to simililar plot 
for a log-Gamma continuous cascade with T = 253 days, A 2 = 0.05 and = 4 (thin line). The fit 
of the extreme tail provides an estimation fx ~ 2 (dashed line). 

2/i « 26, i.e., close to ten times the observed value ! However, the main message of this 
paper is that for multifractal fluctuations, the observed extreme events are far from being 
distributed as they were independent. In particular we have shown in section IS that the 
estimators of the pdf tail exponent strongly depend on v and x an d are "generically" smaller 
than fi. Typically, in finance r ~ 10~ 2 — 1 day, T ~ 1 — 2 years and L ~ 10 years. Therefore, 
a rough estimate of x an d v can be x — v — 0-5- If one uses these values in Eq. (|4*r)|) . 
one finds a typical value for the estimated tail within the log-normal model that is 2/2 ~ 6. 
This is a value closer to the observations. In order to have a better fit of the tail behavior 
one could use a log-gamma model with the same intermittency coefficient and (3 = 4 (see 
Appendix [XJ. In that case, Eq. ()48|) gives an estimator value 2/2 ~ 3.6 that agrees with 
observations. 

In Fig. IHlis reported a rank-frequency plot of estimated daily volatilities associated with 
the 40 stock values composing to the French CAC 40 index. The data are daily 'open' 'high' 
'low' 'close' quotes over a mean period of 10 years. The daily volatilities are estimated 



using the widely used Garman-Klass method 



and each volatility sample mean has been 



normalized to 1. For comparison the Zipf plot associated with a multifractal log-Gamma 
measure with T = 1 year, A 2 = 0.2 and (3 = 4 has been also reported. One can see that 
both curves behave very similarily with a power-law tail exponent fj, ~ 2. Let us note 
that for small volatilites values, the behavior of CAC40 volatility pdf is slightly different 
from the log-Gamma cascade probably because of the hight frequency noise in the Garman- 
Klass volatility estimates. This figure illustrates very well our result: there is no discrepency 
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between the value of the intermittency coefficient and the estimated pdf power-law behavior. 



VIII. SUMMARY AND PROSPECTS 

In this paper we have addressed the problem of extreme value statistics for multifractal 
processes. This problem is non trivial and possesses a rich phenomenology involving non 
ergodic behavior. In the case of multifractal processes, two important parameters govern 
the asymptotics: the overall sample lenght and the scale at which data are sampled. The 
exponent % we have introduced, precisely quantifies how one defines the asymptotic limit 
as one changes these two parameters. The observed extreme value statistics result from 
a "competition" between an increase of the number of independent samples (which tends 
to increase typical extreme values) and a decrease of the sampling scale (which tends to 
decrease typical observed values). Consequently, the law of extremes continuously depends 
on x, an exponent that turns out to be interpreted as a negative dimension. 

This exponent naturally plays an important role when one wants to estimate the tail of 
the probability law associated with multifractal fluctuations. Using the phenomenology of 
the multifractal formalism we have shown that tail exponent estimators continuously depend 
on x- They also depend on another exponent v that quantifies "how many" extremes one 
uses for estimation. Under usual experimental conditions whe have notably shown that the 
obtained exponent is usually smaller than the exponent expected from the unconditional 



law. Such non ergodic behavior are similar to those observed in the 
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;hermodynamics of 



27]. 



disordered systems, at low temperature, under the freezing transition 

The argmuments and methods we used in this paper are mostly phenomenological and rely 
upon large deviation type arguments, multifractal formalism and traveling front solution of 
non linear iteration equations. Beyond the need for sitting it on rigorous bases, they are other 
appealing mathematical prospects raised by our appoach, such as the possibility to address 
finite size effects in multifractal scaling laws or to define a precise statistical framework for 
studying them. 

As far as applications are concerned, we have provided a direct use of our results in the 
field of econophysics where multifractal models for asset returns are popular. Whe have 
shown that the observed fat tails of return pdf are well reproduced by a multifractal model 
designed to account for the volatility clustering phenomenon. Other fields where multifractal 
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processes are involved can be potentially investigated along the same line. Conversely and 
perhaps more importanlty, within this framework, mult ifr act ality appears as an alternative 
ihai^can invoked to explain the origin of fat tails as observed in many fields of applied science 

3, 
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APPENDIX A: LOG-NORMAL, LOG-POISSON AND LOG-GAMMA MULTI- 
FRACTAL MEASURES 

In this appendix we provide 3 examples of multifractal statistics to which we will refer 
all along the paper. We consider the 3 infinitely divisible laws of InW (or equivalently u): 
normal, Gamma and Poisson. 

In the simplest case, u = InW is normal of variance A 2 /ln(2) (and mean — A 2 /21n(2) 
because E(W) = 1). Then E [W q ] = e -i x2 / 2 + x ' 2 i 2 / 2 an d we recover the expression of Eq. (J7|): 

C ln (g)=g(l + A 2 /2)-AV/2 (Al) 
The intermittency coefficient in this case is simply A 2 . If one solves ((q) = 1, q > 1 one gets 

V = Jp (A2) 
In the second example, n is a Poisson random variable of intensity 7 ln(2) and u = mo ln(2) + 
nd. Then q - ln 2 E [e quJ ] = q(l - m ) + 7(1 - e qS ). If one sets C(l) = 1 and -("(0) = A 2 , one 
obtains the spectrum: 

Clp(?)=?(l + - 1)) + ^ (1 - (A3) 

Notice that when <5 — ^ 0, — > £j n . It is easy to show that, when 5 is negative and small 
enough, for all q > 1, C]p(<z) > 1 an d therefore \i = +00 in that case. 
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In the third example, uo is Gamma distributed: if x is a random variable of pdf 
/3 ain(2) a ,ain(2)-i e -^/r(o!ln(2)) and u = x + m ln(2), then q - ln 2 E[e q ") = q(l - m ) + 
aln(l - q//3). By setting ((1) = 1 and -("(0) = A 2 , we have: 

C]g(?) = Q (l - A 2 /? 2 H^)) + A 2 /5 2 In f^j^j (A4) 

Notice that C[g — ► Cl n wnen /3 — > +oo. The solutions of C\g(o) = 1 can be obtained in 
terms of Lambert W function (satisfying W(x)e w ^ = x) and therefore the value of jx can 
be exactly computed as a function of A 2 and (3. 



APPENDIX B: PROOF OF EOJ^Hl 

In this appendix we prove Eq. ()28jl. Let us first study how the law of the maximum 
of ln(M(J n )/r) inside one integral scale varies when on changes the scale r = T2~ n+1 to 
t = T2~ n . Let I n (k), k = . . . 2 n — 1 denote the dyadic intervals of size T2~ n of the interval 
[0, T]. From the cascade construction, Vn, the following stochastic equality can be easily 
proven: 

M[I n (k)} ^J^M^I^k)) , k e [O^"- 1 ) 

fad Z 

M [I n {k)\ = ^M 2 (I n _ 1 (k - 2"" 1 )) , k G [2 n -\ 2 n ] 

fdd Z 

where Wi, W 2 , Mi and M 2 are independent copies of W and M respectively. The symbol 
fdd means an equality in law for all finite dimensional distributions. Therefore, if 

X(n) = max[ln(M(7 n (A;))/T)] 

k 

One has 

X{n) = max [ln(Wi) + X 1 {n - 1), ln(W 2 ) + X 2 (n - 1)] 

law 

If g(x) denote the law of u = ln(W) and P'(x, n) = P [X(n) > x], the previous equality can 
be rewritten as 

-i 2 



P'(x,n) 



J P'(z — x, n — l)g(z) dz 
ig*P'(x,n-l)} 2 



Now if one has = 2 rm > 1 integral scales , the cdf of the maximum is given by 
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APPENDIX C: ARONSON- WEINBERGER CRITERIUM AND FINITE SIZE 
EFFECTS ON FRONT SOLUTIONS OF EQ. flggp 



In this Appendix we provide some additional technical details on the solutions of 



(128)1 . We do not establish rigorous proofs but mostly recast some results from refs. 



<ESI 
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32, 



28, 
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331 ] to our problem. If one linearizes Eq. (}2*8*j) in the tail region x — > +oo, one 



obtains the following recursion for = 1 — P(x,m): 

Q(x,m + 1) = 2 p+r Q{x,m)*g ip \x) (CI) 

where g( p \x) is simply the product of p convolutions g(x) * . . . * g(x). If one decomposes 
Q(x,rn) on Fourier modes: 

Q(k,m) = J e~ tkx Q(x,m) dx 

then (101)1 becomes: 

m + 1) = 2 p+r Q(A;, m)e pF(ifc) 

where F{ik) = InE [e _jfcw ] is the cumulant generating function of u = ln(W) the logarithm 
of cascade weights. The solution is therefore 

Q(k,m) = A (k)e ml ^ r+P ^ +F ^ ln ^ 

where A(k) is simply the Fourier transform of the initial condition. Q(x,m) is obtained as 
the inverse Fourier transform: 

Q(x,m) = (27T)- 1 / e ^ A ^ e rnH2)[r +P (l + Fm/ln(2))] ^ 



In a referential moving at velocity v, (i.e. x m = xq + vm), the previous integral can 
be computed using a steepest descent method: One deforms the integral over the real axis 
to a contour in the complex plane of constant phase. In the limit m — > +oo, the main 
contribution comes from saddle point of the function F(ik) along this path: 

where satisfies: 

dF(ik) l 
* V = dk =k * 
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Moreover, if the front is stationary in the moving frame, the selected velocity should be such 
that the real part of the exponent is zero: 

F(iK) 



3? < ik+mv + mln(2) 



r +p(l + 



ln(2) 'J 

and thus, if v is real, by setting k = iq, and thanks to the equality 

F(-q)=ln(2)(q-((q)) 
one can rewrite respectively Eqs. (jC3|) and (jC2|) as follows: 



(C3) 



ln(2)[r+p(l+ g -C(<?))] 



dv(q) I 

~dq~\ q=q *>x ~~ 







The first equation is the dispersion relationship (J3"3|) while the second is the standard 
Aronson- Weinberger criterium stating that the selected velocity is the minimum velocity, 
a velocity that corresponds to the marginally stable solution (Eq. (|35|)). 

Notice that previous argument assumes that both F(ik) and A(k) are analytical functions. 
If the initial condition decreases exponentially, i.e., A(k) has a complex pole which imaginary 
part is /i (as in Eq. (J3UJ)), then, as discussed in j29|, g* x must be replaced by min(^ jX ,/i). 
This yields Eq. ()36|) . As far as F(ik) is concerned, it is easy to show that it is analytical in 
a strip around the real axis {z, Q(z) < ji}. 

Let us finally remark that a general solution traveling at velocity v of Eq. (|C1|) can be 
written as: 

Q(z m ) = A ie - q ^ + A 2 e~ q2Z ™ 

where z m = zq + vm and q\ and q 2 are the two complex conjugated roots of the dispersion 
relation (jMj) . When v = v(q* a ), the selected minimum velocity, these two roots merge and 



generically the solution behaves has 7, 
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30j: 



Q{z m ) = (Az m + B)e 

We see that the asymptotic shape of the front is not precisely Gumbel but has a subdominant 
correction factor. 

Notice that the matching of the previous functional shape and the shape obtained in 
the previous saddle point analysis, when one accounts for the Gaussian corrections in the 
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integration around the saddle point, leads to famous Bramson logarithmic correction to the 



front velocity 



17 



3 

x m = mv(q^ x ) - - ln(m) 



This "universal" logarithmic correction to the velocity is well known for solutions of KPP 
equation. A specific analysis of these finite-size corrections to scaling is beyond the scope of 
this paper and will be reported in a forthcoming study 

APPENDIX D: SCALING RANGE FOR TAIL ESTIMATION 

One question that naturally arises from the analysis made in section IS is the question of 
stability of ji{v, x) 

function of v (and thus of k) and x- This question is linked 
to the question of the scaling range associated with the log- log representation of Eq. (j2J). 
This problem is important for pratictal purpose. Let p = Aq/q be the precision above which 
one can detect a tail exponent variation (for the sake of simplicity we will consider p ~ 0.1) 
Let ln(S') be the scaling range over which one observes the power-law. According to our 
description, this corresponds to a variation of the Holder exponent, i.e., 

\n(S) ~ Aaln(T/r) 

This variation corresponds therefore to a variation of q that is 

dq Aa 
Aq ~ Aa — — = 

and finally, if the scaling range is such that Aq = pq, one obtains 

\n(S)=pq u , x HT/r)C(q^ x ) (Dl) 

We see that this scaling range is related to the global scaling range rescaled by the factor 
that involves the intermittency coefficient evaluated at the value q = q VjX . 
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